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THE SET CHROMATIC NUMBER OF RANDOM GRAPHS 


ANDRZEJ DUDEK, DIETER MITSCHE, AND PAWEL PRALAT 


Abstract. In this paper we study the set chromatic number of a random graph 
Q[n,p) for a wide range of p = p(n). We show that the set chromatic number, as a 
function of p, forms an intriguing zigzag shape. 


1. Introduction 

A proper colouring of a graph is a labeling of its vertices with colours such that 
no two vertices sharing the same edge have the same colour. A colouring using at most 
k colours is called a proper fc-colouring. The smallest number of colours needed to 
colour a graph G is called its chromatic number, and it is denoted by x{G)- 

In this paper we are concerned with another notion of colouring, first introduced by 
Chartrand et al. [I]. For a given (not necessarily proper) fc-colouring c : V —> [k] of the 
vertex set of G = ( V , E), let 

C(v) = { c(u ) : uv £ E} 

be the neighbourhood colour set of a vertex v. (In this paper, [k] := {1, 2,..., k}.) 
The colouring c is a set colouring if C(u) 7^ C(v) for every pair of adjacent vertices 
in G. The minimum number of colours, k, required for such a colouring is the set 
chromatic number Xs(G) of G. One can show that 

l°g 2 x(G) + 1 < Xs(G) < x(G). (1) 

Indeed, the upper bound is trivial, since any proper colouring c is also a set colouring: 
for any edge uv, N(u), the neighbourhood of u, contains c(v) whereas N(v) does not. 
On the other hand, suppose that there is a set colouring using at most k colours. 
Since there are at most 2 k possible neighbourhood colour sets, one can assign a unique 
colour to each set obtaining a proper colouring using at most 2 k colours. We get that 
x{G) < 2 Xs ( g \ or equivalently, Xs(G) > log 2 x(G). With slightly more work, one can 
improve this lower bound by 1 (see 0), which is tight (see 0). 

Let us recall a classic model of random graphs that we study in this paper. The 
binomial random graph Q(n,p) is the random graph G with vertex set [n] in which 
every pair {i,j} £ (^) appears independently as an edge in G with probability p. Note 
that p = p{n) may (and usually does) tend to zero as n tends to infinity. 
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FIGURE 1. The function r = r(p) for p G (0,1) and p G (0,1/2], respectively. 


All asymptotics throughout are as n —> oo (we emphasize that the notations o(-) and 
O(-) refer to functions of n, not necessarily positive, whose growth is bounded). We say 
that an event in a probability space holds asymptotically almost surely (or a.a.s.) 
if the probability that it holds tends to 1 as n goes to infinity. Since we aim for results 
that hold a.a.s., we will always assume that n is large enough. We often write G(n,p ) 
when we mean a graph drawn from the distribution G(n,p). For simplicity, we will write 
f(n) g{n) if f(n)/g(n) —* 1 as n —* oo (that is, when f{n) = (1 + o(l))g(n)). Finally, 
we use lg to denote logarithms with base 2 and log to denote natural logarithms. 


Before we state the main result of this paper, we need a few definitions that we will 
keep using throughout the whole paper. For a given p = pin) satisfying 

4 (log n) (log log n) 


p > 


log 2 


and 


n 


p < 1 — e 


for some 5 > 0, let 

s = s(jp) = min j [(1 — p) { ] 2 + [1 — (1 — p) 1 } 2 : i G n|, 

and let be a value of £ that achieves the minimum (f? 0 can be assigned arbitrarily if 
there are at least two such values). We will show in Section [3] that 


e 


and that 


Iog(l/2) 

log(l -p) J ’ 
1 

< s(p) < 


log(l/2) ' 
log(l - p) 

1 +p 2 


2 - v-/ - 2 

If p is a constant, then r = r(p) is defined such that n 2 s rlgn = 1, that is, 

2 


( 2 ) 


( 3 ) 


r = r(p) 


lg(l/s)' 


( 4 ) 
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FIGURE 2. The function s = s(p ) for p G (0,1) and p G (0,1/2], respectively. 


Observe that r tends to infinity as p —> 1 and undergoes a “zigzag” behaviour as 
a function of p (see Figure [I])- The reason for such a behaviour is, of course, that 
the function s is not monotone (see Figure [2]). Furthermore, observe that for each 
p — 1 — (l/2) 1 / fc , where A; is a positive integer, £ 0 = k, s = 1/2, and r = 2. 


Now we state the main result of the paper. 
Theorem 1.1. Suppose that p — p(n) is such that 

(logn) 2 (log(np)) 2 


p > 


and 


n 


P < 1 ~£, 


for some e G (0,1). Let G G G(n,p). Then, the following holds a.a.s. 
(i) If p is a constant, then 


Xs(G) ~ rlgn. 

(ii) If p = o(l) and np = n a+ °^ for some a G (0,1], then 

(2a + o(l)) lgn < Xs(G) < (1 + a + o(l)) lgn. 

(iii) If np — n °' 1 >, then 

2(lg (np) - lglogn - lglog(np)) < Xs(G) < (l + o(l))lgn. 


Note that the result is asymptotically tight for dense graphs (that is, for np = n 1 ~° ( T; 
see part (i) and part (ii) for a = 1). For sparser graphs (part (ii) for a G (0,1)) the 
ratio between the upper and the lower bound is a constant that gets large for a small. 
On the other hand, the trivial lower bound of lgy(G) (see (OQ)) gives us the following: 
a.a.s. 


Xs(G(n,p)) > lg x{G(n,p)) ~ lg 


pn 


a lg n, 


2 log(pn) 

provided that pn —* oo as n —* oo, and p = o(l); Xs(G(n,p )) > lg x(G(n,p)) = 0(1) 
otherwise (see BI7|). So the lower bound we prove is by a multiplicative factor of 2+o(l) 
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larger than the trivial one, provided that log(np)/log logn —> oo. If np = log c+ °^n 
for some C G [2, oo), then our bound is by a factor of 2 {C — 1 )/C + o(l) better than the 
trivial one. This seemingly small improvement is important to obtain the asymptotic 
behaviour in the case a — 1 , and in particular, to obtain the zig-zag for constant p. 


The upper and the lower bounds are proved in Section [3] and Section [4j respectively. 
Let us also mention that, in fact, the two bounds proved below are slightly stronger. In 
particular, the upper bound holds for pn > ( 2 / log 2 )(log n )(log logn), the point where 
the trivial bound of y(C?(n,p)) becomes stronger. 


2. Preliminaries 

We will use the following version of Chernoff’s bound. Suppose that X G Bin(n,p) 
is a binomial random variable with expectation p = np. If 0 < 5 < 1, then 


and if 8 > 0 , 


F[X < (1 - 8)p\ < exp 



) 


F[X > (1 + 8)p] < exp 



These inequalities are well known and can be found, for example, in [5]. 


We will also use Suen’s inequality that was introduced in [9] and revised in |4j. 
For a finite set S, let X = {(x, y) : x, y G S, x ^ y}, and for any (x, y ) G X, let A XjV be 
some event with the corresponding indicator random variable I x y . (In our application, 
A XtV will be the event that vertices x and y have the same neighbourhood colour sets.) 
Let X = Yh( xy )^z^x,y be the random variable counting how many such events occur. 
The associated dependency graph has X as its vertex set, and (x±,yi) (^ 2 , 2 / 2 ) if 

and only if {x\,y\} D { Xi,y 2 } 7 ^ 0. Suen’s inequality asserts that 

P(X = 0) < exp (-p + Ae 25 ) , (5) 


where 


p 

A 

5 


^(^,2/)) 

(x,y)Gl 






(x 1 ,y 1 )~(x 2 ,y2) 


J 


max 

(xi,yi)eX 


(X2,S/2)~(X1,J/1) 
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3. Upper bound 

We start by proving (J2]) and (J3|. Since 

[(1 - p) f + [1 - (1 - p)f = 2 [(1 - pf - 1/2] 2 + 1/2, 
it follows that s > 1/2, and consequently (J2]) also holds. Now, let 


' Iog(l/2) ' 
log(l -p) 


l°g(l/2) 
log(l - p ) 


+ 8, 


where 0 < S < 1. Observe that 

s(p)<[(i-pY ] 2 + [i-(i~pY } 2 = 

implying the upper bound in 


42 , M „ _^42 [1- (1~P)T + 1 < [1-(1-P)] 2 + 1 


P 2 + 1 
2 


We keep the definition of function r = r(p) for constant p introduced above (see 03])). 
We extend it here for sparser graphs as follows: suppose that p tends to zero as n —> 
oo, and that np = n a+ ° ^ for some a G [0,1]. Then, we define r = r(p) such that 
n 2 ps rlgn = 1, that is, 

r = r(p) ~ 1 + a, 

since it follows from OS]) that s ~ 1/2. 


The upper bound in Theorem 11.11 follows immediately from the next lemma. 


Lemma 3.1. Suppose that p = p(n) is such that 

2 (log n) (log log n) 

V - --- 

log 2 n 

for some fixed e G (0,1). Let G G Q(n,p). Then, a.a.s. Xs(G) < (r + o(l)) lgn. 


and 


P < 1 -e, 


Before we move to the proof, let us note that the lower bound for p is not necessary, 
and the result can be extended to sparser graphs. The reason it is introduced here is 
that for sparser graphs, the trivial upper bound of y(G) is stronger; note that a.a.s. 


Xs{Q{n,p)) < x(G(n,P)) ~ 


pn 

2 log {pn) ’ 


provided that pn —» oo as n —> oo, and p = o(l); Xs{G) < x(G) = 0(1) otherwise. 


Proof. The proof is straightforward. Let uj = u(n) = o(logn) be any function tending 
to infinity with n (slowly enough). Before exposing the edges of the (random) graph 
G, we partition (arbitrarily) the vertex set into r lg n + oo sets, each consisting of £ 0 
important vertices, and one remaining set of vertices, these being not important. (For 
expressions such as r lg n + oo that clearly have to be an integer, we round up or down 
but do not specify which: the choice of which does not affect the argument.) Note that 

(r lgn + uj)£q = 0(\ogn/p ) = 0{n/ loglogn) = o(n), 

and so there are enough vertices to perform this operation. All vertices in a given set 
receive the same colour, and hence the total number of colours is equal to (r + o(l)) lgn. 
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For a given pair of vertices, x, y, we need to estimate from above the probability 
p(x, y ) that they have the same neighbourhood colour sets. We do it by considering 
sets of important vertices that neither x nor y belong to. Let U be the set of (impor¬ 
tant) vertices of the same colour, and let £q = \U\. Then, either both x and y are not 
connected to any vertex from U, yielding the contribution [(1 — p) e °] 2 to the probabil¬ 
ity p(x,y), or both x and y are connected to at least one vertex from U, giving the 
contribution [1 — (1 — pY 0 ] 2 . Thus, 

P(x,y) < ([(1 -pY °} 2 + [1 - (1 -pY 0 } 2 ) = s Hgra+ "- 2 - 

Hence, the expected number of pairs of adjacent vertices that are not distinguished by 
their neighbourhood colour sets is at most 


n 


ps 


r lg n-\-uj —2 


n 2 ps r lg n 


2 


2 


where the last equality follows from the definition of r. Finally, by (|3lh we get that 
s(p) < (p 2 + l)/2 < ((1 — e) 2 + l)/2 < 1 and so W -2 / 2 tends to zero as n —> oo. Hence, 
the lemma follows by Markov’s inequality. □ 


4. Lower bound 

Before we move to the proof of the lower bound, we need the following technical 
lemma. 

Lemma 4.1. Let 1/2 < x, y < 1 and f3 xi fd y be any positive real numbers. Then, there 
exist unique s and z such that 1/2 < s,z <1 and 

(x 2 + (1 — x) 2 ) l3x (y 2 + (1 — y) 2 Y v = (z 2 + (1 — z) 2 Y x+ ^ v = s^ x+ ^ y . (6) 

Moreover, 

(. x 3 + (1 - x) 3 )^ ( y 3 + (1 - yYY y (^ 3 + (1 - z) 3 Y x+ P y f 3s - 1 \ ,ix+ ^ y 


< 


(x 2 + (1 — x) 2 Y x {y 2 + (1 — y) 2 Y y [z 2 + (1 — z) 2 Y x+ hy 


2s 


(7) 


The lemma can be inductively applied to obtain the following corollary. 

Corollary 4.2. Let 1/2 < aq, x 2 ,..., x^ < 1. Then, there exist unique s and z such 
that 1/2 < s, z < 1 and 


Moreover, 


J{x 2 + (1- Xi ) 2 ) = {z 2 + {1-Z) 2 ) k = s k . 


i— 1 


nti(^ + (l-^) 3 ) < (^ 3 + (1 - zf) k = ( 3s- 1 

UtiYi + (1 - ^) 2 ) “ + (! - V 2s 
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Proof of Lemma yj_. Let 1/2 < x, y < 1 and /3 X , j3 y be fixed positive real numbers. First 
we show that there exist unique numbers z and s satisfying 0. Since f(t) := t 2 + (l — t) 2 
is increasing on [ 1 / 2 , 1 ], we get 

(/(l/ 2 ))^+^ < ( x 2 + (1 - x) 2 f x (y 2 + (1 - y) 2 fy < (/( 1 )) &+ ^. 

Clearly, (f(t))^+^ is also increasing and continuous on t G [ 1 / 2 , 1 ], and thus, there is 
a unique real number z such that 1/2 < z < 1 and 

[x 2 + (1 - x) 2 Y‘(y 2 + (1 - yffy = (/( Z ))' J - +J, ». 

To finish the proof of 0, set s = z 2 + (1 — z) 2 and observe that 1/2 < s < 1, since 
l/2<z< 1 . 

Now we move to the proof of (0. Let 

a = x 2 + (1 — x) 2 and b = y 2 + (1 — y) 2 . 

Since for every real number t, 


j.3 i /-i + ,3_3(t 2 + (l-t) 2 )-l 


we get 


t 3 + (i -ty = 


3 \3 3a 1 o . . o 3b 1 

x 6 + (1 — xy = —-— and y 3 + (1 — yy = —-—. 


Furthermore, 

z 2 + (1 — ^) 2 = (x 2 + (1 — x) 2 ) Px+Py ( y 2 + (1 — y) 2 ) Px+Py = a Px+ Py b Px+ Py , 


and so 


Px Py 


z 3 + (1 - z) 3 = 


3aP x+ Py b Px+ Py — 1 


In order to show the inequality in 0 it suffices to prove 

(z 3 + (1 - z ) 3 Y x+f)y > (: x 3 + (1 - x) 3 Y x (y 3 + (1 - yfY y , 
which is equivalent to 


3a P x+ Py bP x+ Py — 1 


> 


3a- 1 


36—1 


and subsequently to 


Px 


(3a) P*+Py (3b) P*+Py - 1 > (3a - 1) (3b - 1) ft»+Ar 


Set 


P = 


fix + Py 1 fix + Py 

and q — 


Px Py 

Now, showing 0 (and hence also 0) is equivalent to showing 

(3a)p(3b)y > (3a - 1)1(36 - 1)« + 1. 


( 8 ) 


( 9 ) 
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The latter inequality immediately from Holder’s inequality (see, for example, PI): in¬ 
deed, let 

«! = (3a — l)p, b\ — (3 b — 1)® and a 2 = b 2 = 1. 

(Observe that a\ and &i are well-defined since 3a — 1 > 0 and 36 — 1 > 0.) Then, since 
1/p + 1/q = 1, p > 1, and q > 1, Holder’s inequality yields 

(a? + a p 2 )p(b\ + b g 2 )« > a^i + a 2 b 2 , 

as required. Finally note that the equality in (J7J) follows from (JSJ) applied with s = 
z 2 + (1 — z) 2 . The proof of the lemma is finished. □ 


As we did in the previous section, we keep the definition of the function r = r(p) 
for constant p (see (HI) ). We extend it here for sparser graphs as follows (in a different 
way than in the previous section): suppose that p tends to zero as n —> oo and that 
np = n a+ ° (b for some a E [0,1]. This time, r = r(p) is defined such that ( np) 2 s rlgn = 
(log 2 77 ,)(log 2 (np)), that is, 

, ~ 2 (lg (np) - lg log n - lglog(np)) ^ 2 (lg(np) - lglogn - lglog(np)) 0 

7 nP) (lgn)(lga) - lg n 

since s > 1 / 2 . 


Now we are ready to come back to the proof of the lower bound. The lower bound 
in Theorem 11.11 follows immediately from the following lemma. 


Lemma 4.3. Suppose that p = p(n ) is such that 


p » 


(logn) 2 (log(np )) 2 

n 


and p < 1 — e, 


for some fixed £ E (0,1). Let G E Q{n,p). Then, a.a.s. Xs(G) > rlgn, provided that 
p — o(l), and Xs(G) > (1 + o(l))r lgn otherwise. 


Proof. First, let us note that, since the expected degree tends to infinity faster than 
(log 2 n)(log 2 (np)), it follows immediately from Chernoff’s bound and the union bound 
that a.a.s. all vertices have degree at most, say, 2 pn. Hence, since we aim for a statement 
that holds a.a.s., we may assume that the maximum degree of G is at most 2 pn. In 
fact, the argument is slightly more delicate and will be explained soon. 

Suppose that we are given a colouring of the vertices. We partition all colours into 
important and unimportant ones: a colour is important if the number of vertices 
of that colour is at most 21ogn/p. First, let us show that unimportant colours can 
distinguish only a few edges. Formally, we claim the following: 


Claim: A.a.s. each set of 2 log n/p vertices dominates all but at most 2 log n/p vertices. 
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Proof of the claim. Note that the expected number of pairs of disjoint sets of size 
2 log n/p with no edge between them is at most 


/ n 

l 2 log n 

V p 


2 

(1 _ p)( 21 °g™/p) 2 


< 


nep 
2 logn 


4 log n/p 


exp 


4 log 2 n 


P 


= o \ n 4logn/p exp 


4 log 2 n 
P 


= o(l). 


The claim follows from the first moment method. 


□ 


Hence, if p is constant, then O(logn) unimportant colours can distinguish only 
0(log 2 n) = o(n ) vertices. All remaining vertices will have all unimportant colours 
present in their neighbourhood colour sets; as a result, no edge in the graph induced 
by these vertices can be distinguished by unimportant colours. On the other hand, 
if p = o(l), then at most (2 + o(l)) Ig (np) < (2 + o(l)) lgn unimportant colours can 
distinguish at most 6 log 2 n/p = o(n ) vertices, since pn log 2 n. As a consequence of 
the claim, we may concentrate on important colours from now. 


Suppose that a colouring c : / —> [k] using k important colours is fixed; / C V is the 
set of vertices coloured with important colours. Moreover, let us fix a set U C V of 
0(log 2 n/p) = o(n) vertices that are (possibly) distinguished by unimportant colours. 
Our goal is to estimate the probability q(c, U) that important colours distinguish end¬ 
points of edges in G\V \ (I U £/)], which is the graph induced by those vertices that are 
coloured with unimportant colours and that are adjacent to at least one vertex from 
each unimportant colour class. Since the number of configurations to investigate is at 
most 


n 

2 logn 
P 


O (log np) 


0 { 


n 

log 2 n - 


< (np) 


0(log 2 n/p) = exp 


o 


(logn) 2 (log(np)) 


P 


it is enough to estimate q(c, U) from above by, say, 


Q = Q(P) ■= exp (—(log 2 n)(log 3/2 (np))/p). (10) 


The result will then follow immediately by the union bound. 

The expected number of edges in G[V\(IUU)] is > n 2 p /3 n log 2 n, and 

so it follows from Chernoff’s bound that with probability at most exp(—n log 2 n) < Q /2 
the number of edges is smaller than, say, n 2 p/ 4. On the other hand, if the maximum 
degree in G\V \ (I U U)\ is larger than 2 pn (for some configuration (c, [/)), then we stop 
the whole argument and claim no lower bound for \s(G). Recall that at the beginning 
of the proof, we showed that a.a.s. A(G) < 2 pn. Clearly, if this is the case, then 
(deterministically) the degree of each vertex in G\V \ (I U U)\ is at most 2 pn. Hence, 
we may condition on the event that the graph G\V \ (I U U)\ has the following two 
properties: (i) the number of edges is at least n 2 p/ 4, and (ii) no vertex has degree more 
than 2 pn. ft is important that no edge between V \ (I U U) and / has been exposed 
yet. 
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Let us focus on constant p first. Suppose that the number of important colours is 
equal to 


k := 


, 5 log log n 

r lg n H- 

log s 


r lg n — O (log log n) ~ r log n, 


where the error term follows from OH]) and from our assumption that p < 1 — e from 
which we get, as before, s < ((1— e) 2 + l)/2 < 1. Suppose that for a given configuration 
(c, U), the probability p(x,y ) that two adjacent vertices x,y from V \ (/ U U) are not 
distinguished by important colours is t k for some t > s. (Recall that s k is the lower 
bound for p(x,y ) which can be attained when all colour classes have size £q.) We will 
use Suen’s inequality to obtain an upper bound for q(c, U ). Let 


X = {(x,y) : x,y e V \ (I \J U), x ^ y,xy e E}. 


For any (x, y) G X, let A x be the event (with the corresponding indicator random 
variable I Xty ) that the neighbourhood colour sets (restricted to important colours only) 
of x and y are equal. Let X = Yh{ xy )ex^x,y We wish to estimate the probability that 
X = 0. Denote by k* (for 1 < % < k) the number of vertices coloured by colour i. 


Suppose first that t — s. This means 

k 

nK,) = n (K 1 - pD ] 2 + 1 1 - (! - pH 2 ) = A 


(ii) 


In this case, 


2=1 


it i \ ,M n P _ c plgni—5 


fJL = S*- \1\> S K 


= s 


log n ■ 


n 2 p p log 5 n 


> log 4 n, 


4 ° 4 4 

where the last equality follows from the definition of r. Observe that for (x, y) and 
(y, z ) in X we get 

k 

p =n (K 1 - p) 1 "] 3 +[i - (i - p)“f) • 


2=1 


Thus, since the number of pairs (x, y) and (y, z ) is bounded by \X\ ■ Apn, we have 

k 

A < |X| ■ 4 pn - II ([(! — PT f + [1 - (1 - P) K f) 

nti([(i-p) K f + [i-(i-p) K f) 


2=1 


= \Z\ ■ s ■ Apn ■ 


EL-i (K 1 - P) Ki ? + [!-(!- P) Ki ] 2 ) 


= 4 nti([(i-prf + [i-(i -p)*f) 

nti([(i-p) Ki ] 2 +[i-(i-p) K f)' 

Now, using ([Till , we apply Corollary 14.21 with x* = (1 — p) Ki if (1 — p) Ki > 1/2, and 
Xi = 1 — (1 — p) Ki otherwise, to get 

_ i \ k 

A < /i • Apn 


2s 
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Now we will prove that A -C /i. Using Taylor expansion at s — 1, one can show that 
for any s G [1/2,1] we have 


1M = = ! _ ! (1 _ s) 2 _ 5 (1 _ s) 3 _ 105 (1 _ s) 4 + 

2s 3 / 2 8 1 J 8 1 J 128 1 J 

Furthermore, since ([3]) together with p < 1 — e implies 


s < 


p 2 + 1 p + 1 


< 


< 1 - -, 

2 ~ 2 ’ 


we get /y/s < 1 — 3e 2 /32, and hence 

A < p- {Anp)s k/2 ^1 - ^ e 

= p ■ (4 np) ^s( rl gF)/ 2 log 5 / 2 n' S jn~ n< ' £2 ' > 
= p ■ (4plog 5//2 n)n~ n ^ <C p. 




Finally, 

S < 2(2 pn)s k = Apn~ l log 5 n = o(l). 

It follows from Suen’s inequality (see (jSJ)) that 

P(X = 0) < exp (— p + Ae 26 ) = exp(—(1 + o(l))/i) < Q/ 2, 

and q(c, U) < Q/2 + P(X = 0) < Q, as needed (see (OH ). 

Suppose now that t > s. In this case, p is larger than before but, unfortunately, A 
grows faster than p (as t grows) and eventually becomes larger than p. In order to 
avoid this undesired situation, we make the dependency graph sparser so that p is still 
of order log 5 n. Let us note that if t > u, where u is defined so that ( u/s) k = pn , then 
Suen’s inequality can be avoided and we can simply use Chernoff’s bound to obtain the 
desired upper bound for P(A = 0): indeed, it follows from the claim proved above that 
there exists a matching in G[V\(I Uf/)] consisting of at least n/3 edges; otherwise, the 
remaining n — 0( log 2 n/p) — 2n/3 = n/3 — o(n) log n/p vertices in V\(IUU) would 
form an independent set that could be split into two sets of equal size and, clearly, no 
edge will be present between them, contradicting the claim. Since the events are now 
independent, and the expected number of edge endpoints not distinguished is 

t k n s k (t/s) k n s k n 2 p nlog 5 n , d 

f — 3 g — 3 3 ° ’ 

the desired bound holds, since P(X = 0) < exp(—U(/i)) by Chernoff’s bound. 

It remains to consider the case s < t < u. Our goal is to scale the degree in the 
dependency graph down by a multiplicative factor of 

t = m ■.= (s/t) k >(s/u) k = l/(pn). 

Let be a random subgraph of X: each (x,y) G X is independently put into X^ with 
probability £. Since |X| > n 2 p/ 4, E |X^| > £n 2 p/4 > n/4, and thus a.a.s. |X^| > £n 2 p/5. 
Moreover, since the maximum degree in the dependency graph is at most 2 pn, by 
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Chernoff’s bound together with a union bound over all vertices, it follows that a.a.s. 
the maximum degree in a random subgraph of it is at most 


max{4 £pn, 10 log n} < 40£pn log n. 


Therefore, the deterministic (non-constructive) conclusion is that there exists a sub¬ 
graph of the dependency graph with at least ^n 2 p /5 pairs and the maximum degree at 
most 40£pnlogn. We restrict ourselves to this subgraph, stressing one more time that 
no edge between V \ (/ U U) and / is exposed yet. Now, 


li = t k ■ \1(\ > s k 



£ n 2 p 


p log 5 n 
5 


> log 4 n. 


Moreover, 

A < t k 


3t - 1 
2 1 


X^| • 2(40£pn logn) < /i • (80£pnlogn) 

k 


3t - 1 
2 t 


= fi ■ (80pn logn) 


3t-l 
2 1 2 


s k . 


Let h(t) := (3t — l)/(2t 2 ). Note that h{ 1/2) = 1, h(t) is increasing on the interval 
[1/2, 2/3] attaining h( 2/3) = 9/8, and then is decreasing on the interval [2/3,1] going 
back to h( 1) = 1. Hence, if s < 2/3, then ( h(t)s ) k , as a function of t, is maximized 
for t = 2/3. Then, as a function of s, since s < 2/3, r = 2/lg(l/s), and k ~ rlgn, 
{h(2/3)s) k is maximized for s = 2/3. We are back to the case t = s that we already 
checked. On the other hand, if s > 2/3, then, since t > s, h(t) and therefore A is 
maximized again for t = s. Hence, in both cases we have A -C p. Finally, 

6 < 2(40£pnlogn)f fc = 80pn(log n)s k = 80pn -1 log 6 n = o(l). 


Hence, Suen’s inequality can be applied as before, and the proof for constant p is 
finished. 


The case p = o(l) can be verified exactly the same way. In fact, it is slightly easier 
since s = 1/2 + 0(p 2 ), r < 2 + o(l), and we do not have to worry about an increasing 
value of s (and therefore, neither about an increasing value of r). We point out only the 
adjustments of the proof. Recall that the definition of r is extended to the case p = o(l). 
The number of colours is now k = rlgn, and we have ( np) 2 s k = (log 2 n){\og 2 (np)). 

For the case t = s, we have 

_ jfcm ^ „fc n2 P _ (l°g 2 n)(log 2 (np)) ^ (log 2 n)(log 3/2 (np)) 

U S X S’ i 

4 4 p 


p 
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as desired for the union bound (see (flOll). Since now s = 1/2 + 0(p 2 ), the argument for 
A is much easier: 

A < s k Q + 0(p 2 )^J \X\ ■ Apn 

< p(Anp) (s + 0{p 2 )) k 

= Apnps k exp (O ( p 2 log n)) 

= Ap(np)~ 1 (log 2 n)(log 2 (np)) exp (0(p 2 log n)) 

< At, 

since np (log 2 n)(log 2 (np)). (Note that for p = fi(l/v / logn) but still p = o(l), we 
have exp(0(p 2 logn)) = n°d'. However, this causes no problem, as (np)^ 1 = n -1+0 d) 
and so A <C p. Otherwise, that is, if p — o(l/\/logn), we have exp (0(p 2 logn)) ~ 1 
and A <C p follows easily.) Finally, as before, and again using the same lower bound 
on np , we have 

8 < 2(2 pn)s k = 4(np) _1 (log 2 n)(log 2 (np)) = o(l), 
and Suen’s inequality can be applied. 

Now let us consider the case t > s. The definition of u is not affected, and for t > u 
we use Chernoff’s bound since the events are independent. The only difference is that 
the new value of s k has to be used to get 

^ t k n s k (t/s) k n ^ s k n 2 p (log 2 n)(log 2 (np)) (log 2 n)(log 3 ^ 2 (np)) 

3 " ~ = 3 p >> p ’ 

as desired. 

For the case s < t < u, the definition of £ remains the same and again, after adjusting 
the value of s k we get p (log 2 n)(log 3 ^ 2 (np))/p. The argument for A -C p is not 
affected. Finally, 

5 < 2(40£pnlogn)t fc = 80pn(logn)s fc = 80(pn) _1 (log 3 n)(log 2 (np)) = o(l), 

provided that pn (log 3 n)(log 2 (np)). For slightly sparser graphs, that is, when 
(log 2 n)(log 2 (np)) <C pn = 0((log 3 n)(log 2 (np))), observe that we only have that 8 = 
o(logn), but in fact we can show a stronger bound for A: it follows that 

(h(2/3)s) k = (9/8) 21gn/lg(3/2) n- 2 log 5 n = n 21g(9/8)/lg(3/2) - 2 log 5 n, 

and thus 

(80pn log n)(h(t)s) k = O (rz 2lg f 9 / 8 )/ Ig ( 3 / 2 )^ 2 log 9 n log log n) = n~ n ^. 

Hence, A = pn~ n ^\ and so Ae 2S < pn~ n ^n 0 ^ = o(p), as needed for Suen’s inequality 
to be useful. The proof is finished. □ 
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